Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 1460 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 148.4 KiB |
| Average record size in memory | 104.1 B |
Variable types
| NUM | 13 |
|---|
| Dataset has 1 (0.1%) duplicate rows | Duplicates |
MiscVal is highly skewed (γ1 = 24.47679419) | Skewed |
2ndFlrSF has 829 (56.8%) zeros | Zeros |
LowQualFinSF has 1434 (98.2%) zeros | Zeros |
WoodDeckSF has 761 (52.1%) zeros | Zeros |
OpenPorchSF has 656 (44.9%) zeros | Zeros |
EnclosedPorch has 1252 (85.8%) zeros | Zeros |
3SsnPorch has 1436 (98.4%) zeros | Zeros |
ScreenPorch has 1344 (92.1%) zeros | Zeros |
PoolArea has 1453 (99.5%) zeros | Zeros |
MiscVal has 1408 (96.4%) zeros | Zeros |
Reproduction
| Analysis started | 2020-11-06 21:34:56.059678 |
|---|---|
| Analysis finished | 2020-11-06 21:35:19.443380 |
| Duration | 23.38 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
LotArea
Real number (ℝ≥0)
| Distinct | 1073 |
|---|---|
| Distinct (%) | 73.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10516.82808 |
|---|---|
| Minimum | 1300 |
| Maximum | 215245 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 1300 |
|---|---|
| 5-th percentile | 3311.7 |
| Q1 | 7553.5 |
| median | 9478.5 |
| Q3 | 11601.5 |
| 95-th percentile | 17401.15 |
| Maximum | 215245 |
| Range | 213945 |
| Interquartile range (IQR) | 4048 |
Descriptive statistics
| Standard deviation | 9981.264932 |
|---|---|
| Coefficient of variation (CV) | 0.949075601 |
| Kurtosis | 203.243271 |
| Mean | 10516.82808 |
| Median Absolute Deviation (MAD) | 1998 |
| Skewness | 12.20768785 |
| Sum | 15354569 |
| Variance | 99625649.65 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 7200 | 25 | 1.7% | |
| 9600 | 24 | 1.6% | |
| 6000 | 17 | 1.2% | |
| 10800 | 14 | 1.0% | |
| 9000 | 14 | 1.0% | |
| 8400 | 14 | 1.0% | |
| 1680 | 10 | 0.7% | |
| 7500 | 9 | 0.6% | |
| 8125 | 8 | 0.5% | |
| 9100 | 8 | 0.5% | |
| 6120 | 8 | 0.5% | |
| 6240 | 8 | 0.5% | |
| 3182 | 7 | 0.5% | |
| 7800 | 6 | 0.4% | |
| 8450 | 6 | 0.4% | |
| 10000 | 5 | 0.3% | |
| 4500 | 5 | 0.3% | |
| 4435 | 5 | 0.3% | |
| 5000 | 5 | 0.3% | |
| 10140 | 5 | 0.3% | |
| 9750 | 5 | 0.3% | |
| 10400 | 5 | 0.3% | |
| 5400 | 5 | 0.3% | |
| 7018 | 4 | 0.3% | |
| 11700 | 4 | 0.3% | |
| Other values (1048) | 1234 | 84.5% |
| Value | Count | Frequency (%) | |
| 1300 | 1 | 0.1% | |
| 1477 | 1 | 0.1% | |
| 1491 | 1 | 0.1% | |
| 1526 | 1 | 0.1% | |
| 1533 | 2 | 0.1% | |
| 1596 | 1 | 0.1% | |
| 1680 | 10 | 0.7% | |
| 1869 | 1 | 0.1% | |
| 1890 | 2 | 0.1% | |
| 1920 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 215245 | 1 | 0.1% | |
| 164660 | 1 | 0.1% | |
| 159000 | 1 | 0.1% | |
| 115149 | 1 | 0.1% | |
| 70761 | 1 | 0.1% | |
| 63887 | 1 | 0.1% | |
| 57200 | 1 | 0.1% | |
| 53504 | 1 | 0.1% | |
| 53227 | 1 | 0.1% | |
| 53107 | 1 | 0.1% |
1stFlrSF
Real number (ℝ≥0)
| Distinct | 753 |
|---|---|
| Distinct (%) | 51.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1162.626712 |
|---|---|
| Minimum | 334 |
| Maximum | 4692 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 334 |
|---|---|
| 5-th percentile | 672.95 |
| Q1 | 882 |
| median | 1087 |
| Q3 | 1391.25 |
| 95-th percentile | 1831.25 |
| Maximum | 4692 |
| Range | 4358 |
| Interquartile range (IQR) | 509.25 |
Descriptive statistics
| Standard deviation | 386.587738 |
|---|---|
| Coefficient of variation (CV) | 0.3325123481 |
| Kurtosis | 5.745841482 |
| Mean | 1162.626712 |
| Median Absolute Deviation (MAD) | 234.5 |
| Skewness | 1.376756622 |
| Sum | 1697435 |
| Variance | 149450.0792 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 864 | 25 | 1.7% | |
| 1040 | 16 | 1.1% | |
| 912 | 14 | 1.0% | |
| 848 | 12 | 0.8% | |
| 894 | 12 | 0.8% | |
| 672 | 11 | 0.8% | |
| 816 | 9 | 0.6% | |
| 630 | 9 | 0.6% | |
| 936 | 7 | 0.5% | |
| 960 | 7 | 0.5% | |
| 483 | 7 | 0.5% | |
| 832 | 7 | 0.5% | |
| 764 | 6 | 0.4% | |
| 990 | 6 | 0.4% | |
| 728 | 6 | 0.4% | |
| 1056 | 6 | 0.4% | |
| 840 | 6 | 0.4% | |
| 882 | 6 | 0.4% | |
| 1728 | 6 | 0.4% | |
| 720 | 6 | 0.4% | |
| 796 | 5 | 0.3% | |
| 1494 | 5 | 0.3% | |
| 1422 | 5 | 0.3% | |
| 520 | 5 | 0.3% | |
| 1072 | 5 | 0.3% | |
| Other values (728) | 1251 | 85.7% |
| Value | Count | Frequency (%) | |
| 334 | 1 | 0.1% | |
| 372 | 1 | 0.1% | |
| 438 | 1 | 0.1% | |
| 480 | 1 | 0.1% | |
| 483 | 7 | 0.5% | |
| 495 | 1 | 0.1% | |
| 520 | 5 | 0.3% | |
| 525 | 1 | 0.1% | |
| 526 | 1 | 0.1% | |
| 536 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 4692 | 1 | 0.1% | |
| 3228 | 1 | 0.1% | |
| 3138 | 1 | 0.1% | |
| 2898 | 1 | 0.1% | |
| 2633 | 1 | 0.1% | |
| 2524 | 1 | 0.1% | |
| 2515 | 1 | 0.1% | |
| 2444 | 1 | 0.1% | |
| 2411 | 1 | 0.1% | |
| 2402 | 1 | 0.1% |
| Distinct | 417 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 346.9924658 |
|---|---|
| Minimum | 0 |
| Maximum | 2065 |
| Zeros | 829 |
| Zeros (%) | 56.8% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 728 |
| 95-th percentile | 1141.05 |
| Maximum | 2065 |
| Range | 2065 |
| Interquartile range (IQR) | 728 |
Descriptive statistics
| Standard deviation | 436.5284359 |
|---|---|
| Coefficient of variation (CV) | 1.258034335 |
| Kurtosis | -0.5534635576 |
| Mean | 346.9924658 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8130298163 |
| Sum | 506609 |
| Variance | 190557.0753 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 829 | 56.8% | |
| 728 | 10 | 0.7% | |
| 504 | 9 | 0.6% | |
| 672 | 8 | 0.5% | |
| 546 | 8 | 0.5% | |
| 720 | 7 | 0.5% | |
| 600 | 7 | 0.5% | |
| 896 | 6 | 0.4% | |
| 780 | 5 | 0.3% | |
| 862 | 5 | 0.3% | |
| 689 | 5 | 0.3% | |
| 840 | 5 | 0.3% | |
| 756 | 5 | 0.3% | |
| 702 | 4 | 0.3% | |
| 739 | 4 | 0.3% | |
| 551 | 4 | 0.3% | |
| 741 | 4 | 0.3% | |
| 878 | 4 | 0.3% | |
| 804 | 4 | 0.3% | |
| 670 | 3 | 0.2% | |
| 660 | 3 | 0.2% | |
| 1254 | 3 | 0.2% | |
| 793 | 3 | 0.2% | |
| 668 | 3 | 0.2% | |
| 795 | 3 | 0.2% | |
| Other values (392) | 509 | 34.9% |
| Value | Count | Frequency (%) | |
| 0 | 829 | 56.8% | |
| 110 | 1 | 0.1% | |
| 167 | 1 | 0.1% | |
| 192 | 1 | 0.1% | |
| 208 | 1 | 0.1% | |
| 213 | 1 | 0.1% | |
| 220 | 1 | 0.1% | |
| 224 | 1 | 0.1% | |
| 240 | 2 | 0.1% | |
| 252 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 2065 | 1 | 0.1% | |
| 1872 | 1 | 0.1% | |
| 1818 | 1 | 0.1% | |
| 1796 | 1 | 0.1% | |
| 1611 | 1 | 0.1% | |
| 1589 | 1 | 0.1% | |
| 1540 | 1 | 0.1% | |
| 1538 | 1 | 0.1% | |
| 1523 | 1 | 0.1% | |
| 1519 | 1 | 0.1% |
| Distinct | 24 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.844520548 |
|---|---|
| Minimum | 0 |
| Maximum | 572 |
| Zeros | 1434 |
| Zeros (%) | 98.2% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 572 |
| Range | 572 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 48.62308143 |
|---|---|
| Coefficient of variation (CV) | 8.319430317 |
| Kurtosis | 83.23481667 |
| Mean | 5.844520548 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.011341288 |
| Sum | 8533 |
| Variance | 2364.204048 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) | |
| 0 | 1434 | 98.2% | |
| 80 | 3 | 0.2% | |
| 360 | 2 | 0.1% | |
| 528 | 1 | 0.1% | |
| 53 | 1 | 0.1% | |
| 120 | 1 | 0.1% | |
| 144 | 1 | 0.1% | |
| 156 | 1 | 0.1% | |
| 205 | 1 | 0.1% | |
| 232 | 1 | 0.1% | |
| 234 | 1 | 0.1% | |
| 371 | 1 | 0.1% | |
| 572 | 1 | 0.1% | |
| 390 | 1 | 0.1% | |
| 392 | 1 | 0.1% | |
| 397 | 1 | 0.1% | |
| 420 | 1 | 0.1% | |
| 473 | 1 | 0.1% | |
| 479 | 1 | 0.1% | |
| 481 | 1 | 0.1% | |
| 513 | 1 | 0.1% | |
| 514 | 1 | 0.1% | |
| 515 | 1 | 0.1% | |
| 384 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1434 | 98.2% | |
| 53 | 1 | 0.1% | |
| 80 | 3 | 0.2% | |
| 120 | 1 | 0.1% | |
| 144 | 1 | 0.1% | |
| 156 | 1 | 0.1% | |
| 205 | 1 | 0.1% | |
| 232 | 1 | 0.1% | |
| 234 | 1 | 0.1% | |
| 360 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 572 | 1 | 0.1% | |
| 528 | 1 | 0.1% | |
| 515 | 1 | 0.1% | |
| 514 | 1 | 0.1% | |
| 513 | 1 | 0.1% | |
| 481 | 1 | 0.1% | |
| 479 | 1 | 0.1% | |
| 473 | 1 | 0.1% | |
| 420 | 1 | 0.1% | |
| 397 | 1 | 0.1% |
GrLivArea
Real number (ℝ≥0)
| Distinct | 861 |
|---|---|
| Distinct (%) | 59.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1515.463699 |
|---|---|
| Minimum | 334 |
| Maximum | 5642 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 334 |
|---|---|
| 5-th percentile | 848 |
| Q1 | 1129.5 |
| median | 1464 |
| Q3 | 1776.75 |
| 95-th percentile | 2466.1 |
| Maximum | 5642 |
| Range | 5308 |
| Interquartile range (IQR) | 647.25 |
Descriptive statistics
| Standard deviation | 525.4803834 |
|---|---|
| Coefficient of variation (CV) | 0.3467456092 |
| Kurtosis | 4.895120581 |
| Mean | 1515.463699 |
| Median Absolute Deviation (MAD) | 326 |
| Skewness | 1.366560356 |
| Sum | 2212577 |
| Variance | 276129.6334 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 864 | 22 | 1.5% | |
| 1040 | 14 | 1.0% | |
| 894 | 11 | 0.8% | |
| 848 | 10 | 0.7% | |
| 1456 | 10 | 0.7% | |
| 912 | 9 | 0.6% | |
| 1200 | 9 | 0.6% | |
| 816 | 8 | 0.5% | |
| 1092 | 8 | 0.5% | |
| 1344 | 7 | 0.5% | |
| 1728 | 7 | 0.5% | |
| 987 | 7 | 0.5% | |
| 1056 | 6 | 0.4% | |
| 1224 | 6 | 0.4% | |
| 1768 | 6 | 0.4% | |
| 1494 | 6 | 0.4% | |
| 1484 | 6 | 0.4% | |
| 630 | 6 | 0.4% | |
| 1144 | 5 | 0.3% | |
| 1314 | 5 | 0.3% | |
| 960 | 5 | 0.3% | |
| 1252 | 5 | 0.3% | |
| 1710 | 5 | 0.3% | |
| 1392 | 5 | 0.3% | |
| 988 | 5 | 0.3% | |
| Other values (836) | 1267 | 86.8% |
| Value | Count | Frequency (%) | |
| 334 | 1 | 0.1% | |
| 438 | 1 | 0.1% | |
| 480 | 1 | 0.1% | |
| 520 | 1 | 0.1% | |
| 605 | 1 | 0.1% | |
| 616 | 1 | 0.1% | |
| 630 | 6 | 0.4% | |
| 672 | 2 | 0.1% | |
| 691 | 1 | 0.1% | |
| 693 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 5642 | 1 | 0.1% | |
| 4676 | 1 | 0.1% | |
| 4476 | 1 | 0.1% | |
| 4316 | 1 | 0.1% | |
| 3627 | 1 | 0.1% | |
| 3608 | 1 | 0.1% | |
| 3493 | 1 | 0.1% | |
| 3447 | 1 | 0.1% | |
| 3395 | 1 | 0.1% | |
| 3279 | 1 | 0.1% |
| Distinct | 274 |
|---|---|
| Distinct (%) | 18.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.24452055 |
|---|---|
| Minimum | 0 |
| Maximum | 857 |
| Zeros | 761 |
| Zeros (%) | 52.1% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 168 |
| 95-th percentile | 335 |
| Maximum | 857 |
| Range | 857 |
| Interquartile range (IQR) | 168 |
Descriptive statistics
| Standard deviation | 125.3387944 |
|---|---|
| Coefficient of variation (CV) | 1.329931901 |
| Kurtosis | 2.992950925 |
| Mean | 94.24452055 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.541375757 |
| Sum | 137597 |
| Variance | 15709.81337 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 761 | 52.1% | |
| 192 | 38 | 2.6% | |
| 100 | 36 | 2.5% | |
| 144 | 33 | 2.3% | |
| 120 | 31 | 2.1% | |
| 168 | 28 | 1.9% | |
| 140 | 15 | 1.0% | |
| 224 | 14 | 1.0% | |
| 240 | 10 | 0.7% | |
| 208 | 10 | 0.7% | |
| 216 | 9 | 0.6% | |
| 180 | 8 | 0.5% | |
| 160 | 8 | 0.5% | |
| 250 | 6 | 0.4% | |
| 132 | 6 | 0.4% | |
| 264 | 6 | 0.4% | |
| 143 | 6 | 0.4% | |
| 96 | 6 | 0.4% | |
| 156 | 6 | 0.4% | |
| 171 | 5 | 0.3% | |
| 48 | 5 | 0.3% | |
| 196 | 5 | 0.3% | |
| 105 | 5 | 0.3% | |
| 288 | 5 | 0.3% | |
| 210 | 5 | 0.3% | |
| Other values (249) | 393 | 26.9% |
| Value | Count | Frequency (%) | |
| 0 | 761 | 52.1% | |
| 12 | 2 | 0.1% | |
| 24 | 2 | 0.1% | |
| 26 | 2 | 0.1% | |
| 28 | 2 | 0.1% | |
| 30 | 1 | 0.1% | |
| 32 | 1 | 0.1% | |
| 33 | 1 | 0.1% | |
| 35 | 1 | 0.1% | |
| 36 | 4 | 0.3% |
| Value | Count | Frequency (%) | |
| 857 | 1 | 0.1% | |
| 736 | 1 | 0.1% | |
| 728 | 1 | 0.1% | |
| 670 | 1 | 0.1% | |
| 668 | 1 | 0.1% | |
| 635 | 1 | 0.1% | |
| 586 | 1 | 0.1% | |
| 576 | 1 | 0.1% | |
| 574 | 1 | 0.1% | |
| 550 | 1 | 0.1% |
| Distinct | 202 |
|---|---|
| Distinct (%) | 13.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.66027397 |
|---|---|
| Minimum | 0 |
| Maximum | 547 |
| Zeros | 656 |
| Zeros (%) | 44.9% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 25 |
| Q3 | 68 |
| 95-th percentile | 175.05 |
| Maximum | 547 |
| Range | 547 |
| Interquartile range (IQR) | 68 |
Descriptive statistics
| Standard deviation | 66.25602768 |
|---|---|
| Coefficient of variation (CV) | 1.419966538 |
| Kurtosis | 8.490335806 |
| Mean | 46.66027397 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 2.36434174 |
| Sum | 68124 |
| Variance | 4389.861203 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 656 | 44.9% | |
| 36 | 29 | 2.0% | |
| 48 | 22 | 1.5% | |
| 20 | 21 | 1.4% | |
| 40 | 19 | 1.3% | |
| 45 | 19 | 1.3% | |
| 30 | 16 | 1.1% | |
| 24 | 16 | 1.1% | |
| 60 | 15 | 1.0% | |
| 39 | 14 | 1.0% | |
| 28 | 14 | 1.0% | |
| 44 | 13 | 0.9% | |
| 50 | 13 | 0.9% | |
| 54 | 13 | 0.9% | |
| 72 | 12 | 0.8% | |
| 98 | 11 | 0.8% | |
| 63 | 11 | 0.8% | |
| 35 | 11 | 0.8% | |
| 32 | 11 | 0.8% | |
| 75 | 10 | 0.7% | |
| 42 | 10 | 0.7% | |
| 120 | 10 | 0.7% | |
| 96 | 10 | 0.7% | |
| 64 | 9 | 0.6% | |
| 66 | 9 | 0.6% | |
| Other values (177) | 466 | 31.9% |
| Value | Count | Frequency (%) | |
| 0 | 656 | 44.9% | |
| 4 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 12 | 3 | 0.2% | |
| 15 | 1 | 0.1% | |
| 16 | 8 | 0.5% | |
| 17 | 2 | 0.1% | |
| 18 | 5 | 0.3% |
| Value | Count | Frequency (%) | |
| 547 | 1 | 0.1% | |
| 523 | 1 | 0.1% | |
| 502 | 1 | 0.1% | |
| 418 | 1 | 0.1% | |
| 406 | 1 | 0.1% | |
| 364 | 1 | 0.1% | |
| 341 | 1 | 0.1% | |
| 319 | 1 | 0.1% | |
| 312 | 2 | 0.1% | |
| 304 | 1 | 0.1% |
| Distinct | 120 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.95410959 |
|---|---|
| Minimum | 0 |
| Maximum | 552 |
| Zeros | 1252 |
| Zeros (%) | 85.8% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 180.15 |
| Maximum | 552 |
| Range | 552 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 61.1191486 |
|---|---|
| Coefficient of variation (CV) | 2.783950237 |
| Kurtosis | 10.43076594 |
| Mean | 21.95410959 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.089871904 |
| Sum | 32053 |
| Variance | 3735.550326 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1252 | 85.8% | |
| 112 | 15 | 1.0% | |
| 96 | 6 | 0.4% | |
| 120 | 5 | 0.3% | |
| 144 | 5 | 0.3% | |
| 192 | 5 | 0.3% | |
| 216 | 5 | 0.3% | |
| 252 | 4 | 0.3% | |
| 116 | 4 | 0.3% | |
| 156 | 4 | 0.3% | |
| 126 | 3 | 0.2% | |
| 228 | 3 | 0.2% | |
| 128 | 3 | 0.2% | |
| 184 | 3 | 0.2% | |
| 102 | 3 | 0.2% | |
| 150 | 3 | 0.2% | |
| 40 | 3 | 0.2% | |
| 176 | 3 | 0.2% | |
| 164 | 3 | 0.2% | |
| 77 | 2 | 0.1% | |
| 185 | 2 | 0.1% | |
| 80 | 2 | 0.1% | |
| 180 | 2 | 0.1% | |
| 84 | 2 | 0.1% | |
| 160 | 2 | 0.1% | |
| Other values (95) | 116 | 7.9% |
| Value | Count | Frequency (%) | |
| 0 | 1252 | 85.8% | |
| 19 | 1 | 0.1% | |
| 20 | 1 | 0.1% | |
| 24 | 1 | 0.1% | |
| 30 | 1 | 0.1% | |
| 32 | 2 | 0.1% | |
| 34 | 2 | 0.1% | |
| 36 | 2 | 0.1% | |
| 37 | 1 | 0.1% | |
| 39 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 552 | 1 | 0.1% | |
| 386 | 1 | 0.1% | |
| 330 | 1 | 0.1% | |
| 318 | 1 | 0.1% | |
| 301 | 1 | 0.1% | |
| 294 | 1 | 0.1% | |
| 293 | 1 | 0.1% | |
| 291 | 1 | 0.1% | |
| 286 | 1 | 0.1% | |
| 280 | 1 | 0.1% |
| Distinct | 20 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.409589041 |
|---|---|
| Minimum | 0 |
| Maximum | 508 |
| Zeros | 1436 |
| Zeros (%) | 98.4% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 508 |
| Range | 508 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 29.31733056 |
|---|---|
| Coefficient of variation (CV) | 8.598493896 |
| Kurtosis | 123.6623794 |
| Mean | 3.409589041 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.30434203 |
| Sum | 4978 |
| Variance | 859.505871 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=20)
| Value | Count | Frequency (%) | |
| 0 | 1436 | 98.4% | |
| 168 | 3 | 0.2% | |
| 216 | 2 | 0.1% | |
| 144 | 2 | 0.1% | |
| 180 | 2 | 0.1% | |
| 245 | 1 | 0.1% | |
| 238 | 1 | 0.1% | |
| 290 | 1 | 0.1% | |
| 196 | 1 | 0.1% | |
| 182 | 1 | 0.1% | |
| 407 | 1 | 0.1% | |
| 304 | 1 | 0.1% | |
| 162 | 1 | 0.1% | |
| 153 | 1 | 0.1% | |
| 320 | 1 | 0.1% | |
| 140 | 1 | 0.1% | |
| 130 | 1 | 0.1% | |
| 96 | 1 | 0.1% | |
| 23 | 1 | 0.1% | |
| 508 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1436 | 98.4% | |
| 23 | 1 | 0.1% | |
| 96 | 1 | 0.1% | |
| 130 | 1 | 0.1% | |
| 140 | 1 | 0.1% | |
| 144 | 2 | 0.1% | |
| 153 | 1 | 0.1% | |
| 162 | 1 | 0.1% | |
| 168 | 3 | 0.2% | |
| 180 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 508 | 1 | 0.1% | |
| 407 | 1 | 0.1% | |
| 320 | 1 | 0.1% | |
| 304 | 1 | 0.1% | |
| 290 | 1 | 0.1% | |
| 245 | 1 | 0.1% | |
| 238 | 1 | 0.1% | |
| 216 | 2 | 0.1% | |
| 196 | 1 | 0.1% | |
| 182 | 1 | 0.1% |
| Distinct | 76 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.0609589 |
|---|---|
| Minimum | 0 |
| Maximum | 480 |
| Zeros | 1344 |
| Zeros (%) | 92.1% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 160 |
| Maximum | 480 |
| Range | 480 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 55.75741528 |
|---|---|
| Coefficient of variation (CV) | 3.70211589 |
| Kurtosis | 18.43906784 |
| Mean | 15.0609589 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.122213743 |
| Sum | 21989 |
| Variance | 3108.889359 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1344 | 92.1% | |
| 192 | 6 | 0.4% | |
| 224 | 5 | 0.3% | |
| 120 | 5 | 0.3% | |
| 189 | 4 | 0.3% | |
| 180 | 4 | 0.3% | |
| 160 | 3 | 0.2% | |
| 168 | 3 | 0.2% | |
| 144 | 3 | 0.2% | |
| 126 | 3 | 0.2% | |
| 147 | 3 | 0.2% | |
| 90 | 3 | 0.2% | |
| 200 | 2 | 0.1% | |
| 198 | 2 | 0.1% | |
| 216 | 2 | 0.1% | |
| 184 | 2 | 0.1% | |
| 259 | 2 | 0.1% | |
| 100 | 2 | 0.1% | |
| 176 | 2 | 0.1% | |
| 170 | 2 | 0.1% | |
| 288 | 2 | 0.1% | |
| 142 | 2 | 0.1% | |
| 153 | 1 | 0.1% | |
| 154 | 1 | 0.1% | |
| 152 | 1 | 0.1% | |
| Other values (51) | 51 | 3.5% |
| Value | Count | Frequency (%) | |
| 0 | 1344 | 92.1% | |
| 40 | 1 | 0.1% | |
| 53 | 1 | 0.1% | |
| 60 | 1 | 0.1% | |
| 63 | 1 | 0.1% | |
| 80 | 1 | 0.1% | |
| 90 | 3 | 0.2% | |
| 95 | 1 | 0.1% | |
| 99 | 1 | 0.1% | |
| 100 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 480 | 1 | 0.1% | |
| 440 | 1 | 0.1% | |
| 410 | 1 | 0.1% | |
| 396 | 1 | 0.1% | |
| 385 | 1 | 0.1% | |
| 374 | 1 | 0.1% | |
| 322 | 1 | 0.1% | |
| 312 | 1 | 0.1% | |
| 291 | 1 | 0.1% | |
| 288 | 2 | 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.75890411 |
|---|---|
| Minimum | 0 |
| Maximum | 738 |
| Zeros | 1453 |
| Zeros (%) | 99.5% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 738 |
| Range | 738 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 40.17730694 |
|---|---|
| Coefficient of variation (CV) | 14.56277759 |
| Kurtosis | 223.2684989 |
| Mean | 2.75890411 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.82837364 |
| Sum | 4028 |
| Variance | 1614.215993 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) | |
| 0 | 1453 | 99.5% | |
| 738 | 1 | 0.1% | |
| 648 | 1 | 0.1% | |
| 576 | 1 | 0.1% | |
| 555 | 1 | 0.1% | |
| 519 | 1 | 0.1% | |
| 512 | 1 | 0.1% | |
| 480 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1453 | 99.5% | |
| 480 | 1 | 0.1% | |
| 512 | 1 | 0.1% | |
| 519 | 1 | 0.1% | |
| 555 | 1 | 0.1% | |
| 576 | 1 | 0.1% | |
| 648 | 1 | 0.1% | |
| 738 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 738 | 1 | 0.1% | |
| 648 | 1 | 0.1% | |
| 576 | 1 | 0.1% | |
| 555 | 1 | 0.1% | |
| 519 | 1 | 0.1% | |
| 512 | 1 | 0.1% | |
| 480 | 1 | 0.1% | |
| 0 | 1453 | 99.5% |
| Distinct | 21 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.4890411 |
|---|---|
| Minimum | 0 |
| Maximum | 15500 |
| Zeros | 1408 |
| Zeros (%) | 96.4% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 15500 |
| Range | 15500 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 496.1230245 |
|---|---|
| Coefficient of variation (CV) | 11.408001 |
| Kurtosis | 701.0033423 |
| Mean | 43.4890411 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.47679419 |
| Sum | 63494 |
| Variance | 246138.0554 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=21)
| Value | Count | Frequency (%) | |
| 0 | 1408 | 96.4% | |
| 400 | 11 | 0.8% | |
| 500 | 8 | 0.5% | |
| 700 | 5 | 0.3% | |
| 450 | 4 | 0.3% | |
| 2000 | 4 | 0.3% | |
| 600 | 4 | 0.3% | |
| 1200 | 2 | 0.1% | |
| 480 | 2 | 0.1% | |
| 1150 | 1 | 0.1% | |
| 800 | 1 | 0.1% | |
| 15500 | 1 | 0.1% | |
| 620 | 1 | 0.1% | |
| 3500 | 1 | 0.1% | |
| 560 | 1 | 0.1% | |
| 2500 | 1 | 0.1% | |
| 1300 | 1 | 0.1% | |
| 1400 | 1 | 0.1% | |
| 350 | 1 | 0.1% | |
| 8300 | 1 | 0.1% | |
| 54 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1408 | 96.4% | |
| 54 | 1 | 0.1% | |
| 350 | 1 | 0.1% | |
| 400 | 11 | 0.8% | |
| 450 | 4 | 0.3% | |
| 480 | 2 | 0.1% | |
| 500 | 8 | 0.5% | |
| 560 | 1 | 0.1% | |
| 600 | 4 | 0.3% | |
| 620 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 15500 | 1 | 0.1% | |
| 8300 | 1 | 0.1% | |
| 3500 | 1 | 0.1% | |
| 2500 | 1 | 0.1% | |
| 2000 | 4 | 0.3% | |
| 1400 | 1 | 0.1% | |
| 1300 | 1 | 0.1% | |
| 1200 | 2 | 0.1% | |
| 1150 | 1 | 0.1% | |
| 800 | 1 | 0.1% |
SalePrice
Real number (ℝ≥0)
| Distinct | 663 |
|---|---|
| Distinct (%) | 45.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180921.1959 |
|---|---|
| Minimum | 34900 |
| Maximum | 755000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 11.5 KiB |
Quantile statistics
| Minimum | 34900 |
|---|---|
| 5-th percentile | 88000 |
| Q1 | 129975 |
| median | 163000 |
| Q3 | 214000 |
| 95-th percentile | 326100 |
| Maximum | 755000 |
| Range | 720100 |
| Interquartile range (IQR) | 84025 |
Descriptive statistics
| Standard deviation | 79442.50288 |
|---|---|
| Coefficient of variation (CV) | 0.4391000319 |
| Kurtosis | 6.53628186 |
| Mean | 180921.1959 |
| Median Absolute Deviation (MAD) | 38000 |
| Skewness | 1.88287576 |
| Sum | 264144946 |
| Variance | 6311111264 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 140000 | 20 | 1.4% | |
| 135000 | 17 | 1.2% | |
| 145000 | 14 | 1.0% | |
| 155000 | 14 | 1.0% | |
| 190000 | 13 | 0.9% | |
| 110000 | 13 | 0.9% | |
| 160000 | 12 | 0.8% | |
| 115000 | 12 | 0.8% | |
| 139000 | 11 | 0.8% | |
| 130000 | 11 | 0.8% | |
| 125000 | 10 | 0.7% | |
| 143000 | 10 | 0.7% | |
| 185000 | 10 | 0.7% | |
| 180000 | 10 | 0.7% | |
| 144000 | 10 | 0.7% | |
| 175000 | 9 | 0.6% | |
| 147000 | 9 | 0.6% | |
| 100000 | 9 | 0.6% | |
| 127000 | 9 | 0.6% | |
| 165000 | 8 | 0.5% | |
| 176000 | 8 | 0.5% | |
| 170000 | 8 | 0.5% | |
| 129000 | 8 | 0.5% | |
| 230000 | 8 | 0.5% | |
| 250000 | 8 | 0.5% | |
| Other values (638) | 1189 | 81.4% |
| Value | Count | Frequency (%) | |
| 34900 | 1 | 0.1% | |
| 35311 | 1 | 0.1% | |
| 37900 | 1 | 0.1% | |
| 39300 | 1 | 0.1% | |
| 40000 | 1 | 0.1% | |
| 52000 | 1 | 0.1% | |
| 52500 | 1 | 0.1% | |
| 55000 | 2 | 0.1% | |
| 55993 | 1 | 0.1% | |
| 58500 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 755000 | 1 | 0.1% | |
| 745000 | 1 | 0.1% | |
| 625000 | 1 | 0.1% | |
| 611657 | 1 | 0.1% | |
| 582933 | 1 | 0.1% | |
| 556581 | 1 | 0.1% | |
| 555000 | 1 | 0.1% | |
| 538000 | 1 | 0.1% | |
| 501837 | 1 | 0.1% | |
| 485000 | 1 | 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| LotArea | 1stFlrSF | 2ndFlrSF | LowQualFinSF | GrLivArea | WoodDeckSF | OpenPorchSF | EnclosedPorch | 3SsnPorch | ScreenPorch | PoolArea | MiscVal | SalePrice | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 8450 | 856 | 854 | 0 | 1710 | 0 | 61 | 0 | 0 | 0 | 0 | 0 | 208500 |
| 1 | 9600 | 1262 | 0 | 0 | 1262 | 298 | 0 | 0 | 0 | 0 | 0 | 0 | 181500 |
| 2 | 11250 | 920 | 866 | 0 | 1786 | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 223500 |
| 3 | 9550 | 961 | 756 | 0 | 1717 | 0 | 35 | 272 | 0 | 0 | 0 | 0 | 140000 |
| 4 | 14260 | 1145 | 1053 | 0 | 2198 | 192 | 84 | 0 | 0 | 0 | 0 | 0 | 250000 |
| 5 | 14115 | 796 | 566 | 0 | 1362 | 40 | 30 | 0 | 320 | 0 | 0 | 700 | 143000 |
| 6 | 10084 | 1694 | 0 | 0 | 1694 | 255 | 57 | 0 | 0 | 0 | 0 | 0 | 307000 |
| 7 | 10382 | 1107 | 983 | 0 | 2090 | 235 | 204 | 228 | 0 | 0 | 0 | 350 | 200000 |
| 8 | 6120 | 1022 | 752 | 0 | 1774 | 90 | 0 | 205 | 0 | 0 | 0 | 0 | 129900 |
| 9 | 7420 | 1077 | 0 | 0 | 1077 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 118000 |
Last rows
| LotArea | 1stFlrSF | 2ndFlrSF | LowQualFinSF | GrLivArea | WoodDeckSF | OpenPorchSF | EnclosedPorch | 3SsnPorch | ScreenPorch | PoolArea | MiscVal | SalePrice | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1450 | 9000 | 896 | 896 | 0 | 1792 | 32 | 45 | 0 | 0 | 0 | 0 | 0 | 136000 |
| 1451 | 9262 | 1578 | 0 | 0 | 1578 | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 287090 |
| 1452 | 3675 | 1072 | 0 | 0 | 1072 | 0 | 28 | 0 | 0 | 0 | 0 | 0 | 145000 |
| 1453 | 17217 | 1140 | 0 | 0 | 1140 | 36 | 56 | 0 | 0 | 0 | 0 | 0 | 84500 |
| 1454 | 7500 | 1221 | 0 | 0 | 1221 | 0 | 113 | 0 | 0 | 0 | 0 | 0 | 185000 |
| 1455 | 7917 | 953 | 694 | 0 | 1647 | 0 | 40 | 0 | 0 | 0 | 0 | 0 | 175000 |
| 1456 | 13175 | 2073 | 0 | 0 | 2073 | 349 | 0 | 0 | 0 | 0 | 0 | 0 | 210000 |
| 1457 | 9042 | 1188 | 1152 | 0 | 2340 | 0 | 60 | 0 | 0 | 0 | 0 | 2500 | 266500 |
| 1458 | 9717 | 1078 | 0 | 0 | 1078 | 366 | 0 | 112 | 0 | 0 | 0 | 0 | 142125 |
| 1459 | 9937 | 1256 | 0 | 0 | 1256 | 736 | 68 | 0 | 0 | 0 | 0 | 0 | 147500 |
Most frequent
| LotArea | 1stFlrSF | 2ndFlrSF | LowQualFinSF | GrLivArea | WoodDeckSF | OpenPorchSF | EnclosedPorch | 3SsnPorch | ScreenPorch | PoolArea | MiscVal | SalePrice | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2522 | 970 | 739 | 0 | 1709 | 0 | 40 | 0 | 0 | 0 | 0 | 0 | 130000 | 2 |